Recognition of (3) party conversation using prosody and gaze

نویسنده

  • Yosuke Matsusaka
چکیده

We have developed a recognition system that can understand the multi-party conversation from combined information of prosody and gaze. In multi-party conversation, the conversation becomes complex because many overlaps and interrupts are generated by side participants. And thus becomes difficult to keep track the main thread of the conversation. Gaze works as a strong clue to both clarify and perceive “whose talking to whom” and “whose listening to whom”, and can be used to improve the understanding of the conversational situation. We have analyzed the gaze behavior in conversational situations based on actual human-to-human conversation recoding, and created a computational model to recognize the main thread of the conversation. The performance has improved up to 20 point compared to the condition that only used prosody.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker diarization using eye-gaze information in multi-party conversations

We present a novel speaker diarization method by using eyegaze information in multi-party conversations. In real environments, speaker diarization or speech activity detection of each participant of the conversation is challenging because of distant talking and ambient noise. In contrast, eye-gaze information is robust against acoustic degradation, and it is presumed that eyegaze behavior plays...

متن کامل

Turn-alignment using eye-gaze and speech in conversational interaction

Spoken interactions are known for accurate timing and alignment between interlocutors: turn-taking and topic flow are managed in a manner that provides conversational fluency and smooth progress of the task. This paper studies the relation between the interlocutors’ eye-gaze and spoken utterances, and describes our experiments on turn alignment. We conducted classification experiments by Suppor...

متن کامل

A Multiparty Multimodal Architecture for Realtime Turntaking

Many dialogue systems have been built over the years that address some subset of the many complex factors that shape the behavior of participants in a face-to-face conversation. The Ymir Turntaking Model (YTTM) is a broad computational model of conversational skills that has been in development for over a decade, continuously growing in the number of factors it addresses. In past work we have s...

متن کامل

The effect of adding gaze direction recognition to stabilizing exercises on pain, muscular endurance and proprioception women with chronic non-specific neck pain

Aims and background: Gaze direction recognition is one of the new treatments method for neck pain. The positive effects of stabilization exercises in various studies on neck pain have also been confirmed. Therefore, the aim of this study was to investigate the effect of adding a gaze direction recognition program to common stabilizing exercises on neck pain intensity, muscular endurance and pro...

متن کامل

Quantitative analyses of Gaze Activity during Silence: Comparison between Native-Language and Second-Language Conversation

We analyze gazes during silence in multi-party conversation and compare them between conversations among nativelanguage speakers and those among second-language speakers. The duration of gaze during silence shows a significant difference between these two conditions: Gaze during silence is longer in a second-language conversation. Correlation analyses for gazes during silence and the values fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005